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ABSTRACT 

Weak lensing measurements of cluster masses are necessary for calibrating mass- 
observable relations (MORs) to investigate the growth of structure and the properties 
of dark energy. However, the measured cluster shear signal varies at fixed mass Af 2 oom 
due to inherent ellipticity of background galaxies, intervening structures along the 
line of sight, and variations in the cluster structure due to scatter in concentrations, 
asphericity and substructure. We use -/V-body simulated halos to derive and evalu- 
ate a weak lensing circular aperture mass measurement M ap that minimizes the mass 
estimate variance ((Map — M200m) 2 } in the presence of all these forms of variability. 
Depending on halo mass and observational conditions, the resulting mass estimator 
improves on M ap filters optimized for circular NFW-profile clusters in the presence of 
uncorrelated large scale structure (LSS) about as much as the latter improve on an es- 
timator that only minimizes the influence of shape noise. Optimizing for uncorrelated 
LSS while ignoring the variation of internal cluster structure puts too much weight on 
the profile near the cores of halos, and under some circumstances can even be worse 
than not accounting for LSS at all. We briefly discuss the impact of variability in clus- 
ter structure and correlated structures on the design and performance of weak lensing 
surveys intended to calibrate cluster MORs. 

Key words: gravitational lensing: weak - galaxies: clusters: general - cosmology: 
observations 



1 INTRODUCTION 

Counts of dark matter halos and their clustering prop- 
erties as a function of halo mass and redshift are a 
useful tool of observational cosmology, allowing both 
the determination of parameters of ACDM cosmology 
jWang fc SteinhardtJ Il998l : iHolder et all l200ll) and con- 
strain ts on the physics of Dark Energy ( Weller et alj 
120021) in a way that i s complementary to other probes 
l|Takada fc Bridle! 120071 : ICunha et al.ll2009l ). Upcoming and 
ongoing surveys such as the Dark Energy S urvey (DES) 
|The Dark Energy Survey Collaboration! [2005), or the At- 
acama Cosmology Telescope (ACT) and South Pole Tele- 
scope (SPT) surveys will make such measurements of the 
non-linear growth of structure. 

The detection of clusters and measurement of their 
masses can be performed using various signals, such 
as galax y density and the abundance of red seq uence 
galaxies (|Kepner et al l 1 19991 : Idadders fc Yeej [2000), the 
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Sunyaev-Zel'dovich (SZ) effect (jSunvaev fc Zeldovichlll972l : 
lHaiman et al.ll200ll : [Battve fc Weller]l2003l) or measurements 
of the cluster X-r a y emission (e.g. iBoehringer et al.l 120041 : 
ISahlen et all 120091 : [pTffaretti et all |2010| ). In principle, SZ 
or X-Ray survey s can use a self-calibrated mass-obs ervable 
relation (MOR) (|HuI|2003| : uVlaiumdar fc Mohrll2004l ). How- 
ever, constraints on cosmological parameters can be greatly 
improved if the MOR can be determined independently. 
Weak gravitiational lensing has therefore become an im- 
portant tool for finding the MOR of these methods (e.g . 
I Allen et al.ll2002l : lOkabe et alj|20ld : iHoekstra et al.ll201ll ). 
since the lensing signal is most closely related to the true 
mass of a cluster. As massive objects leave their imprint on 
space-time, their presence is revealed by the weak distortion 
of images of galaxies behind them at small angular separa- 
tions, allowing for reconstruction of their masses, which can 
then be used to calibrate other mass estimators. A recently 
developed alternative weak lensing approach is shear peak 
statistics, which can be used to constrain cosmology based 
on the lensing signal of the projected matter density and 
without the requirement for individual halo estimates of the 
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virial mass (IMarian fc Bernstein! r2006l ; IMarian et ail |2009|; 
iDietrich fc Hartladl2010t l. 

Halos can be detected with weak lensing for instance 
by applying inver sion of the shear signal for generating con- 
vergence k maps jKaiser fc Sauiresl|l995 ; ISchneider fc Seit3 
ll995l ; ISeitz fc Schneidedll995l , ll996U200lf ) or t he application 
of an appropri ate filter (see Section 12.21 and Kaiser 1995; 
ISchneider|[l996h, from which cluster candidates can be iden- 
tified fe.g. lSchirmer et al.|[2007l ; lMivazaki et al.ll2007l ) . Clus- 
ters detected in this way (or through baryonic signatures) 
can have mass estimates refined, depending on the strength 
of their lensing signal, by strong lensing mass profile re- 
constructions, by applying linear filters to the shear field, 
and/or by fitting a profile to the shear values observed. 

While strong lensing is compl ementary and only pos- 
sible in relatively rare cases (e.g. lAbdelsalam et all 1 19981 ; 
iBradac et alj 120051 ; lHalkola et alj|2006h . the last two meth- 
ods are generally applicable ways of using the lensing sig- 
nal. We will assume that weak-lensing mass estimators op- 
erate on the azimuthally averaged tangential reduced shear 
profile, gt(8), which suffers from several sources of noise. 
For one thing, background galaxies are not intrinsically 
round, adding white noise to the shear measurement (shape 
noise) inversely proportional to the background galaxy den- 
sity. Intrinsic alignments of background galaxies with re- 
spect to each other might introduce non-zero covariance 
betw een gt at different radii and must be treated care- 
fully ( Mandclb aum et al.ll200ft ). but we expect the intrinsic- 
alignment signal to be small compared to other forms of 
variance, so we will not consider it further. Secondly, any 
uncorrelated structures along the line of sight between 
observer and background sources will introduce an addi- 
tional weak lensing signal. While the expectation value of 
gt from uncorrelated structures is zero, its variance and co- 
variance must be taken into account when designing an op- 
timal method of weak lensing meas urements increase the 
uncertainty of weak lensing masses l|Hoekstra|[200ll . 120031 ; 
Irloekstra et aljfeoill ). Thirdly, dark matter halos are more 
likely to occur in overdense regions, where the probability 
of the presence of additional halos is higher than elsewhere. 
This can cause a signal with non-zero expectation value and 
signific ant variance that d isturbs weak lensing mass esti- 
mates l|Metzler et al.ll200lf ). 

The first two of these effects have been addressed in 
several ways. Shape noise is relatively well-controllable and 
deeper images with a larger number of background sources 
can at least in principle reduce it to a level that does not 
limit the intended scientific application, essentially allow- 
ing for a more complete and clean detection of less mas- 
sive halos. Intrinsic alignments ca n be accounted for when 
redshift information is pr e sent (iKing fc Schneider! 120031 : 
iHevmans fc Heavens! 120031 : IJoachimi fc Schneider! l2008h . 
The influence of shear covariance due to uncorrelated struc- 
tures on mass estimates can be accounted for when con- 
structing a fil ter to measure the amplitude of a known true 
shear profile jDodelsonll2003 ; iMaturi et alj|2005h . 

What has not been accounted for, so far, is the vari- 
ability of the cluster profile itself due to correlated halos and 
internal structure. The lensing signal of correlated halos can- 
not be discriminated from the cluster signal itself and, after 
appropriate calibration of the mass estimate, remains as a 
source of noise. This might be particularly severe as corre- 



lated halos align along a cosmic web. There is more to intrin- 
sic variability, however, than correlated halos. Profile fitting 
methods and optimized linear-filter mass estimators assume 
the presence of a true shear profile, typically the character- 
istic shear signature of Navarro- Frenk- White (NFW) profile 
dark matter halos jNavarro et all 1 19961 : iBartelmannl 1 199(11 : 
IWright fc Brainerdll200oir The average profile of dark mat- 
ter halos as measured from simulations or with weak lens- 
ing is consistent with the N FW profile (|Mandelbaum et al.l 
120061 : IJohnston etafl 120071 ) although th ere are claims for 
small deviations in the core (IMerritt et al.ll2006l ) and at large 
radii (|Qguri fc Hamanall20lj r What is certainly not true, 
however, is that individual halos follow spherical NFW pro- 
files with concentrat ions that only depe nd on their masses. 
Asphericity of halos (|Warren et al 11993 ) can lead to an over- 
estimation of the mass of clusters aligned with the line of 
sight and an underestimation for those aligned perpendicu- 
lar to it, esp ecially when fitting NFW shear profiles to ob- 
served shear jClowe et al.ll2004l : ICorless fc King||2007f ). Lin- 
ear filters, even those adapted to different mass ranges, will 
suffer from the width of the distribution of conc entration 
parameters at a given mass fcf. iBullock et al . 2001, for mea- 
surements of the concentration distributions). Fitting for 
concentration as a second parameter, however, introduces 
the problem of degeneracy with the primary measurement 
of mass. Additionally, dark matter halos contain substruc- 
ture, different amounts of baryonic matter and correlated 
structure outside the virial radius that induce variance in 
the actual profiles. When constructing a filter that yields a 
minimum variance mass estimate, the variance of true shear 
profiles could be taken into consideration as an additional 
component of the noise if there was an appropriate analytic 
model. The variety of effects involved, some of which are not 
entirely understood, makes this very difficult, however. 

We subsume all profile variability due to the effects 
mentioned in the previous paragraph under the term cor- 
related structures in what follows. We take them into 
account by using cosmological simulations of dark matter 
halos, which include all kinds of correlated structures. Since 
we know the true M2oom masses of these simulated halos as 
well as the reduced shear profile gt(8) including all correlated 
structures, we can derive the linear function M ap of gt that 



yields the minimum variance ((Map — M21 



) in the lens- 



ing estimation of the cluster virial masses. Thus, we drop 
the usual assumption of a universal shear profile, find the 
effect on cluster mass reconstruction, and re-optimize mass 
estimators taking the correlated structures into account. By 
taking into account not only shape noise and uncorrelated 
LSS, but also correlated structures, we give a full estimate 
of the scatter in lensing mass estimates. 

The structure of this paper is as follows. In Section 2 
we discuss our methodology for finding a minimum variance 
filter in the presence of realistic cluster profiles and uncor- 
related LSS. In Section 3 we investigate the radial depen- 
dence of halo profile variability and calculate and analyze 
the minimum variance filter for the simulated halos. In the 
final Section 4 we discuss our results and implications for 
the design of weak lensing cluster filters in future surveys. 
The Appendix contains details of the halo model formalism 
used to calculate the covariance matrix of the LSS lensing 
signal. 
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2 METHODS 
2.1 Simulations 

We use a simulation that is p art of the one anal- 
ysed by iBecker fc Kravtsovl (|201Ch . It was made using 
1024 3 dark matter particles of mass 6.98 ■ 10 10 h~ 1 M & 
in a box of comoving size 1 Gpcfr" 1 with a paral- 
lelized Adaptive Refinement Tree algorithm (jKravtsov et al.l 
ll997l : lGottloeber fc Klvpinll2008h . It is simulated from red- 
shift z — 60 to 2 = with cosmological parameters 
(Sl m ,Q b ,<T8,h,n) = (0.27,0.044,0.79,0.7,0.95) at an effec- 
tive spacial reso lution of 30 kpcfe - 1 . The same simula- 
tion was used in iTinker et all £2008), where it was labeled 
L1000W. We use a snapshot at redshift Zd = 0.245, in which 
14,856 halos of M 20 om > 1 ■ 1O 14 M were identified, 24 of 
which are heavier than M2oom = 1 • 10 15 Mq. 

For the selected halos, the mass is integrated along the 
line of sight and the lensing signal is determined on a grid 
of approximately 40 comoving kpc /pixel using th e Born 
approximation as described in IBecker fc Kravtsovl l|2010l . 
Section 3), taking into account all matter within comov- 
ing ±200/i~ Mpc along the line of sight and, transversely, 
in a square with comoving side length 20/i~ 1 Mpc centered 
on the cluster. We assume background sources to be at a 
constant redshift z s = 1, which at our Zd = 0.245 scales 
gravitational shear by a factor of 0.76 compared to sources 
at infinity. We extract from this tangential reduced shears 
in radial bins beginning at ro = 1' and having a width 
n e [(1 - A) ■ n, (1 + A) ■ n] with A = 0.1. Shear in- 
side r — 1' (approximately 160 kpc proper radius) is subject 
to resolution effects and biased low because of smoothing, 
which is why we discard it for our analysis. Since the to- 
tal weight of the excluded region is low, our results do not 
change significantly if the signal inside r — 1' is included. 



2.2 Minimum Variance Filter 

A useful approach of measuring halo masses is to use a filter 
u(0) that weights the projected ma ss density within an aper- 
ture (|Kaiserlll995l ; ISc"hneiderl ll996i). Let for all the following 
g, 7 and k be the azimuthally averaged g t , jt and R at the 
respective radius. The aperture mass within an aperture u 
is defined as 



M ap = 2tt I d6 6 u(6)k(0) 



A compensated aperture, fulfilling 

o = dee u(0) , 



(1) 



(2) 



is insensitive to mass sheets. The compensated aperture 
mass can be expressed equivalently in ter ms of tangential 
shear s 7t, which are related to k as (see ISchneider et al.l 
l200d . Eqn. 24, for this azimuthally averaged relation) 



7«(0) = <«(O)(«'<«) -*(*)• 

With a corresponding shear filter q-y(9), 

M ap = 2tt / de e g 7 (0h*(0) • 



(3) 



(4) 



The tangential shear weight function q 7 can be calculated 
from the projected mass density weight function u via 



9t(0)= 02 J de' e' u (e') - u(e) 

and vice versa, using Eqn. [3l as 

u{6) = 2 r dd 'iip.- q ^e), 



(•») 



(6) 



which is by definition compensated. 

Since the observable is the reduced shear g = 7/(1 — ft) 
M a p must in practice be determined as 



M ap = 2tt dee q(0)g(0) 



(7) 



with reduced shear weights q(e). 

We will assume that the integral [7] will be executed as 
a sum over measured average tangential reduced shears gj 
in a finite number of annuli, weighted as 



N 

Map = V Qm = g T Q , 



=1 



(8) 



where Qj is the weight for the average tangential reduced 
shear gj in annulus j. We wish to find the Qj which minimize 
the mean squared error of the lensing mass estimator relative 
to M2oom, the mass enclosed in a sphere of 200 times the 
mean matter density: 



2 

Cm 



((M a 



M 2 , 



o 2 ). 



(9) 



Most previous optimizations have assumed that the 
shape of the mass profile is known a priori, such that 
gj — M2oom<7trucj + 6j, where ej is a measurement noise 
with zero mean and known variance. In this case, the filter 
Q that minimizes a\ t is the same as the (Wiener) filter that 
maximizes signal-to-noise ratio of M ap . When the ej are un- 
corrected between diff erent annuli (as for shap e noise), the 
Wiener filter is (e.g. iMarian fc Bernsteinll2006h 



Qj oc gtruc, j/Var(tj). 



(10) 



Such filters can be constructed analytically for widely used 
dark matter halo mass profiles, such as the NFW pro- 
file dNavarro et all 1 19961). where the shear is given by 
iBartelmannl (|l996l ) and IWright fc Brainerdl (|200Ch . The fil- 
ter derived under these assumptions — a cluster shear profile 
assumed to follow the NFW form, with known concentra- 
tion, and measurement noise due entirely to shape noise — we 
will henceforth denote as the "S filter." 

If the measurement noise has non-zero covariance be- 
tween annuli, as is the case if we consider shear from un- 
corrected LSS to be part of the noise, then we can define 
a covariance matrix via Cjk ~ (ej€k), and we can generalize 
equation (|10l) to 



Q oc C 1 g tr 



(11) 



We will denote as the "S+U" filter the Q derived with these 
assumptions of known NFW shear profile with measurement 
noise from the combination of shape noise plus uncorrelated 
projected large-scale structures. 

Equation (|11|) is not optimal, however, when there is 
variability in the shear profiles of halos. Consider the re- 
duced shear signal of simulated cluster i, 1 ^ i ^ n in annu- 
lus j, 1 ^ j ^ N to be given as the element gij of a matrix 
g. Since the simulation does not include shape noise or LSS 
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outside a certain integration region, we assume that mea- 
surements will incur an additional noise Ej with covariance 
matrix C that is independent of the cluster properties. For 
filter Q the mass estimation variance becomes 

na 2 M = (M - gQ) T (M - gQ) + nQ T CQ, (12) 

where M is the vector of true masses M2oom . The minimum 
variance weights can be found from Eqn. I12l as 

-1 ~T , 



Qo = {fg + nCy 



M . (13) 

These minimum variance weights are not necessarily unbi- 
ased, but typically systematically underestimate the mass 
slightly. We rather minimize the variance under the con- 
straint of no bias, i.e. (Mi) — Q ■ (g) = 0, where (g) is the 
mean reduced shear of the sample, which we add to Eqn. 1121 
with a Lagrange multiplier. The resulting minimum variance 
filter among unbiased linear filters is 



( Mi) - ((gg o)<) 
"g + nCy^g). 



Q = Qo + AQ * (W) 
where AQ = 

This filter will be denoted as the l 'S + U + C" or the 
"optimal linear" filter, since it incorporates correlated struc- 
tures, and has the lowest possible a\j of any linear operation 
applied to the azimuthally averaged shear profiles of realistic 
simulated halos. 

In the remainder of the paper we will examine the prop- 
erties and performance of the optimal linear filter, and the 
degradation that correlated structure induces on the sim- 
pler 5* and S + U filters. Note that all three of the filters 
discussed here depend upon the value of M2oom for the clus- 
ter: the scale radius of the NFW profile assumed in the S and 
S + U filter derivations is typically mass-dependent; and the 
g matrix in the S + U + C filter also depends upon the mass 
range of clusters taken from the simulation for the optimiza- 
tion. In practice, of course, the cluster mass is not known 
in advance, and some kind of iterative procedure would be 
needed to choose a filter that is (approximately) optimized 
for each cluster's mass. We will leave treatment of this iter- 
ative procedure to future work, and in this paper simplify 
the task by assigning the simulated clusters to bins based 
on their known M2oom values, and apply a single filter to 
each cluster-mass bin. Also, we only optimize filters for one 
redshift of Zd = 0.245. While the method we develop can be 
used as a general prescription for optimizing mass measure- 
ments, the change in shear amplitudes, angular scales and 
correlated structure of clusters requires that the filters be 
recalibrated for each individual redshift. 

2.3 Uncorrelated Large Scale Structure 

The presence of large scale structure (LSS) between sources 
and observer that is uncorrelated to the observed cluster is 
an additional source of noise that must be accounted for 
when finding op t imal filters for wea k lensing measurements 
jDodelsorJ 12004 ; iMaturi et al1l2005h . We use a halo model 
approach to calculate the lensing covariance matrix of LSS 
that is outside the ±200/i -1 Mpc region integrated in the 
simulation. We assume that structures beyond 200ft - 1 Mpc 
are uncorrelated with the target cluster. The halo model 
also allows to exclude shear noise from uncorrelated halos 
with Miaom > 10 14 AfQ. The rationale for this is that if 



these massive halos were present along the line of sight to 
a target cluster, they would be detected independently, and 
the interference could be modelled or the target cluster could 
be discarded from the sample. For details of the calculation, 
we refer the reader to Appendix IA1 



3 RESULTS 

3.1 Variability of Halo Profiles 

We begin by investigating the mean and variance of halo 
shear profiles. Taking the average of the simulated cluster 
shear profiles, we find that both globally and for narrow 
mass bins it is possible to fit a NFW profile to the cluster 
with deviations from the average shear that are typically 
within 5-10% of the signal. The individual profiles, however, 
show a great variety of structures. These must be interpreted 
as due to variations in concentration, halo ellipticity, and 
mass in the form of substructure and correlated second ha- 
los. 

In narrow mass bins, we find the covariance matrix of 
the shear signal in our radial bins with respect to the em- 
pirical mean. We estimate the expected covariance matrix 
from uncorrelated LSS within the ±200/i _1 Mpc integration 
region of the simulations using the halo model approach de- 
scribed in Appendix rS] and subtract this from the covari- 
ance "observed" in the simulation to yield a matrix that can 
be fully attributed to internal halo variation and correlated 
halos. Figure [1] shows the dispersion of shears in relation 
to the mean signal as a function of radius for an example 
mass bin. The overplotted dispersions due to shape noise of 
different levels and uncorrelated LSS between background 
sources at z = 1 and observer show that the intrinsic vari- 
ance has its highest relative importance inside and at the 
virial radius, 2-7', inside of which the shape noise and out- 
side of which the noise of uncorrelated LSS increase more 
steeply than the intrinsic variance. In this range, the intrin- 
sic dispersion of the profile is close to proportional to the 
mean shear signal. Note that the intrinsic variance and LSS 
are correlated between radial bins, unlike the shape noise, 
so this plot somewhat over-represents the relative impact of 
shape noise on mass estimation. 

Figure [5] plots the intrinsic dispersion of shears due to 
correlated structures in relation to the mean signal along 
with some expected contributions to this. The dotted line 
plots the dispersion in g t expected from the variance of con- 
centration _values_ibr_the best- fit NFW profiles to the halos. 
We follow [Bullock et al.l (|200ll ) and take a log-normal distri- 



bution of concentrations with <xi D 



0.18. The dot- 



ted line also includes the range of M2oom variation within 
the bin, but because the mass bin is narrow, the variation of 
concentrations dominates this component. The concentra- 
tion variation expl ains most of the shear variance at r ~ 1' 
(in agreement with lReed et al.ll201ll ). but is subdominant at 
larger radii. The overdensity of correlated neighboring ha- 
los introduces additional variance, which we calculate using 
a halo model by assuming a Poisson distribution of mutu- 
ally independent halos (see Appendix). This neighbor- halo 
shot noise becomes more important towards the outskirts. 
These two components, however, are not enough to explain 
the entire variability of shear signals. Additional contribu- 
tions that are difficult to estimate analytically stem from 
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Figure 1. Relative shear dispersion due to shape noise of three 
different levels (dotted lines for 10, 40 and 100 background galax- 
ies per arcmin 2 ), uncorrelated LSS along the line of sight between 
z s = 1 and the observer (dashed) and intrinsic profile variance 
due to correlated structures (solid line). The latter is calculated 
for halos with a narrow range of masses (1.8 . . . 2.0T0 14 Mq) with 
respect to their mean shear signal. 

the aspherical halo profile, the distribution of neighboring 
halos, and substructure of halos, for instance due to recent 
mergers. Simulations remain the most viable way of assess- 
ing these. 



3.2 Improvements in Mass Uncertainty 

We compare here the variance in halo masses obtained with 
three different linear filters Q for the tangential shear profile: 

(i) The S filter, which is the Wiener optimal filter for 
measuring halos that are assumed to have the mean (NFW) 
reduced shear profile of the individual mass bin, with uncer- 
tainty from shape noise only, as per Equation (jTUJ ; 

(ii) The S + U Wiener filter that in addition accounts for 
shear variance from uncorrelated LSS, as per Equation 

and 

(iii) Our minimum-variance S+U+C filter that takes into 
account the full variance including correlated structures and 
variation of internal halo structure, as per Equation (1131) . 

Although these filters were derived under different assump- 
tions about sources of shear variance, they are always eval- 
uated taking into account the full noise in the lensing signal 
due to shape noise, plus uncorrelated LSS, plus the corre- 
lated structure as measured in the TV-body simulations. We 
consider shape noise levels corresponding to an intrinsic el- 
lipticity dispersion of <r e = 0.3 and three survey depths of 
10, 40 and 100 background galaxies per arcmin 2 . We calcu- 
late the uncertainty of mass estimates based on the different 
filters as 



Figure 2. Relative variations of halo shear profiles (solid line) in 
a narrow range of masses (1.8 .. . 2.0 ■ lO 14 A/ ) and contributions 
to this from the expected distribution of profiles from variations 
in concentration with Clog 10 c 2 oom = 0.18 and masses (dotted) 
and Poisson noise in excess density of second halos due to linear 
two-point correlation (dashed). The variance of shear profiles has 
contributions from uncorrelated LSS within the integration region 
of the simulations subtracted. 

omIM = \j <(Q ■ 9i - Mi) 2 ) + Q T CQ/{M l ) , (15) 

where gi is the vector of radially binned tangential reduced 
shears for halo i extracted from the simulations without 
shape noise and without LSS outside the integration region, 
Mi is the true M2oom mass of halo i as measured from the 
simulations and C is the covariance matrix due to shape 
noise and uncorrelated LSS outside the integration region. 
In order to find the dependence on cluster mass we split the 
sample into narrow mass bins with a population of at least 
700 halos each and find ctm/M for each mass bin at three 
different levels of shape noise. 

Figures [3] plot cjm /M from Eqn. [TS] versus Mzoom for 
the three different filters at each survey depth. We further 
compare the result to the naive prediction 

ctm/M = \Jq t CQ/{M 1 ) (16) 

where Q is the S + U filter optimized for the mean shear pro- 
file of the bin in the presence of uncorrelated LSS and shape 
noise. This is the mass uncertainty one would expect if all 
clusters were true NFW halos with no scatter in concentra- 
tion c(M, z) at constant mass and redshift, and no correlated 
structure was present. Statistical la errors in <tm /M due to 
the limited size of our sample of halos were estimated with 
a jackknife approach (i.e. by leaving out one halo from the 
analysis in turns) and are always below 0.01. 
The following results are of interest: 

• The S + U + C filter has the lowest mass estimate vari- 
ance, which is true by definition. 
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Figure 3. Relative errors in estimates of M200m for different 
aperture mass shear filters, when applied to simulated TV-body 
halos at z = 0.245. All symbols show measurement errors in- 
cluding shape noise, uncorrelated LSS, halo profile variation, and 
correlated structures. Each panel shows RMS errors from: the S 
filter, optimized for NFW clusters with shape noise only (skeletal 
symbols); S + U filters optimized for NFW clusters accounting for 
uncorrelated LSS as well as shape noise (open); and the S + U + C 
minimum variance filters (solid symbols) optimized with the in- 
clusion of halo profile variation and correlated structures. Also 
plotted is the prediction for S + U filters in the unrealistic case 
that all halos have the same profile with no correlated structures 
(solid line). 



• The uncertainty of the mass estimates with the S + U 
filter predicted for the naive case of no correlated structure 
is always significantly lower than any real uncertainty. 

• Due to variance in the intrinsic profile, which can be 
mitigated neither by deeper data nor by heavier clusters, 
there is a lower limit to mass uncertainty. Deeper surveys 
benefit mass estimates remarkably little, especially for rel- 
atively massive halos. Observations beyond a background 
galaxy density of 40 arcmin -2 are not notably improving 
mass estimates for the mass range of halos and redshift 
z c i = 0.245 considered here, and will only benefit cases where 
shape noise is more dominant, i.e. less massive clusters and 
objects at higher redshift. 

• On the high signal-to-noise end of massive halos and 
low shape noise, the S + U filter that is optimized for uncor- 
related LSS and shape noise, but not for intrinsic variability 
of profiles, yields a mass estimate that is no improvement 
over the S filter without taking LSS into account. 

• On realistic ground-based data with a background 
galaxy density of 10 arcmin 2 , the improvement from taking 
into account intrinsic profile variability is very small since 
the S + U and S + U + C filters are almost identical and the 
large shape noise in the center prevents the S + U filter from 
assigning too much weight to the core. The increase in mass 
uncertainty from the naive prediction, however, remains rel- 
evant even at these observational conditions. 

Can a non-linear fitting process improve upon our opti- 
mal linear shear estimates of Maoom? For comparison, we 
note that for two-dimensional fitting of an NFW profile 
with M2oom and C2oom as free parameters, a sample with 
halos of mass Af2oom > 4 ■ 10 14 Mq from the same simu- 
lation has om/ M = 0.4 (0.27) for shape noise levels of 10 
(40) arcmin -2 ( Becker fc Kravtsovll2010l ). We do not have 
results for the exact corresponding mass range and the meth- 
ods differ in that we pre-sort halos into true mass bins. How- 
ever, the comparison of mass uncertainties suggests that the 
two-parameter non-linear fit does not yield significant im- 
provements. This is potentially because the contribution of 
concentration variation to the intrinsic variability of profiles 
is subdominant (cf. Section [3. ip and a degeneracy between 
mass and concentration exists. 



3.3 A Look at Optimized Filters 

We briefly examine the optimal linear filters found by our 
procedure. These take into account the covariance due to un- 
correlated LSS along the line of sight and shape noise while 
minimizing the variance of the mass estimate empirically by 
using the simulated cluster shear profiles. Figure[4]compares 
these S + U + C weights to the S and S + U filters optimized 
for NFW-profile clusters. 

The optimized filters differ from the NFW-optimized fil- 
ters in that they do not put as much weight to the innermost 
region, shifting the maximum of q ■ r to r ~ 5' for the mass 
bin 1 ... 2 • 10 14 Mq. In terms of weights on convergence u, 
this puts more weight on projected density at intermediate 
distance from the core. Consequently, the convergence filter 
u compensates at larger radii. All filters that take LSS into 
account, however, do put less weight on the shear signal out- 
side r w 10', where the LSS signal becomes comparable to 
or larger than shape noise. 
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Figure 4. Fitted weights for different levels of shape noise (circu- 
lar, triangular and square symbols for 10, 40 and 100 background 
galaxies per arcmin 2 , respectively). Plotted for comparison are 
NFW Wiener filters for shape noise only (solid line) and for LSS 
and the three levels of shape noise (long dashed, short dashed and 
dotted lines). The two panels show tangential shear weights and 
convergence weights transformed according to the transformation 
of reduced shear weights q(9) into <J 7 (0) cj(t9)/(l — k(6)) and 
application of Eqn. \E\ 



This suggests an explanation for why the S + U filter 
that includes uncorrelated LSS can perform worse than the 
S filter optimized only for shape noise: the S+U filter places 
higher weight on the intrinsically variable central region, 
generating noise that outweighs the gain from putting less 
weight to the outer regions where uncorrelated LSS domi- 
nates the errors. 



Schirmer et all il2007l ). iMandelbaum et all (|2010t ) and 
iHoekstra et all ( 20111 ) discuss various reasons for the high 
uncertainty of shear measurements near the core and the 
benefits of downweighting the shear signal in that region. 
Some of the effects mentioned are not present in our simu- 
lations, such as baryonic effects, signal dilution due to clus- 
ter member galaxies (which are most prominent near the 
center), and magnification- induced change and incomplete 
sampling of the redshift distribution of sources when no or 
only uncertain single source redshift estimates are present. 
In practical application, this would degrade the performance 
of S + U filters even more since they weight the central re- 
gions more heavily. 



4 SUMMARY 

By the linear least squares method outlined in Section 12.21 
we have found filters for the tangential shear signal of sim- 
ulated halos that minimize the variance between mass es- 
timates and true M2oom of the clusters in the presence of 
shape noise, uncorrelated projected LSS and the intrinsic 
variability of halo shear profiles. We have shown in Sec- 
tion 13.21 that taking into account the last source of noise 
can improve upon the mass estimates of filters that assume 
a constant NFW profile. This improvement is comparable 
to the improvement that has been previously noted as re- 
sulting from the inclusion of uncorrelated LSS in the filter 
optimization, compared to optimizing solely for shape noise 
at M ^ 1.5 • 10 14 A#q and a background galaxy density of 
40arcmin~ 2 or more. The most obvious difference from fil- 
ters that do not take intrinsic profile variability into account 
is a suppression of weight on the central region of the halo 
(cf . Section 13 - 3 p , where intrinsic variability of profiles has 
its most important contribution to the overall noise (cf. Sec- 
tion [3TTJ. 

Apart from these observations, this has a number of im- 
portant conclusions for weak lensing measurements of clus- 
ter masses: 

• Even when accounting for shape noise and noise due 
to uncorrelated LSS, the uncertainties of an aperture mass 
measurement will be substantially underestimated if we ne- 
glect the intrinsic variability of halo profiles. The intrinsic 
variability of halo shear profiles sets a lower limit to the ac- 
curacy of lensing mass estimations which, especially for low 
shape noise and relatively massive halos, can be a multi- 
ple of what would be expected for fixed halo profiles. This 
has to be accounted for when calibrating the MOR, even 
more so as the errors in mass estimators must be known 
to g reat accuracy for cluster count probes to be success- 
ful dLima fc Hu|[2005). It also means that, at least with 
this method of estimating cluster masses with weak lens- 
ing, there is limited benefit in making ever deeper observa- 
tions, because uncertainty in the signal remains both in the 
core (because of intrinsic variability) and in the outer parts 
(because of LSS) of halos. For more massive halos, the prob- 
lem persists since at least some components of the intrinsic 
variability of the profile (e.g. concentration distribution, as- 
phericity, substructure) are likely to scale with the overall 
profile amplitude. Ground based surveys and even more so 
space-based surveys and deep space-based observations will 
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have to take this into account when predicting the accu- 
racy of their MOR calibration as a function of survey depth 
and width. Deeper lensing surveys will show advantage only 
when calibrating clusters at redshifts well away from the 
z = 0.245 studied herein. 

• The intrinsic variations in shear profiles can be ex- 
plained by variations i n concentration only in the core re- 
gions of halos (cf. also lReed et al.ll201ll ). It is important to 
take into account as well the variations of profiles due to 
substructure and correlated structures, which dominate the 
intrinsic profile variability near and outside the virial radius. 

• When, in addition to shape noise, a perture mass fil- 
ters account for u ncorrelated LSS only (|Dodelsonl |2004| ; 
iMaturi et ai1l2005h . the uncertainty of mass estimates can 
actually become larger. This is a result of the intrinsic profile 
variability near the core of the halo, where LSS optimized 
filters put more weight than would a filter optimized for 
the influence of both uncorrelated LSS and intrinsic vari- 
ability. We expect optimization for uncorrelated LSS to be 
non-beneficial for clusters of mass M 20 om ~ 3.0 ■ 10 14 M Q at 
the simulated redshift of z — 0.245 and background galaxy 
levels above 10 arcmin - . 
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APPENDIX A: HALO MODEL 

In the following we describe the halo model used to cal- 
culate the covariance of shear in annuli due to structure 
uncorrelated to the central halo and outside the integra- 
tion region of the simulations. For a general review of 
ha lo models of large scal e structure, we refer the reader 
to ICoorav fc Shethl (|2002h . Approaches similar to ours are 



presented in Coorav et al.l (|2000l . in Fourier space) and 
iTakada fc Jainl (|2003l ." in real space). 

Consider a set of halos 1, . . . , i, . . . , n with < Zi < z s = 
1. The tangential shear 7 t fe they introduce in an annulus k is 



(Al) 



where hi = (9i $ i, 62,1, Zi, Mi, a, . . .) is a set of parameters 
describing the position and mass profile of halo i and -yt(h-i) 
is the tangential shear such a halo introduces on annulus k. 

We can then straightforwardly calculate the covariance 
of 7^ and 7] in annuli k and I as 

^70 = /e^)-E^A = 



£(7?(fci)-7i(fc))) + (X! 



(it {hi) 



•7] fo)) ) ,(A2) 



where the average goes over random realizations of the halo 
populations. Note that we have separated the terms caused 
by the shear of a single halo on both annuli (one-halo term) 
from correlations caused by pairs of halos (two-halo term). 
The latter will not be zero, because halo positions are cor- 
related. 

The average in Eqn. IA2l can be expressed as an integral 
over probabilities dPi(h) of finding a halo with properties 
h and dP2(h±, /12) of simultaneously finding two halos with 
properties hi and /12, i.e. 



Cov(7f,7J) 



dP^^w-^Kh) 



+ / dP 2 (/ ll ,h 2 ) 7t fc (/ii)- 7] (h a ) 



(A3) 



The probability density dPi(h) of finding a halo with 
properties h has as its cosmological ingredients the halo 
mass function, -^(M, z), and the distribution of halo den- 
sity profiles. We can write 



dPAh) = — dM dQ. dz p(c\M, z) dc . 

dAI d\l dz 



(A4) 



assuming NFW halo profiles and a probability density 
p(c\M, z) of finding concentration parameters c = C2oom = 
rwom/rs which only depends on the halo mass M = Mwo m 
and redshift z. Note that we use definitions of r2oom, C200m, 
and M200771 with respect to 200 times the mean matter 
density at the halo's redshift. We assume as a fixed mass- 
concen tration relation the best-fit for mula from Duffy et al.l 
(2008) (using WMAP5 cosmology, cf. lKomatsu et al.l l2009). 



c(M, 



10.14 ■ 



M 



2- 10 12 M(T 



© 



(1 



(A5) 



For the halo mass function and the mass dependent biases 
b(M) used below, we ado pt the Sheth- Tormen m ass function 
and the associated bias ijSheth fc TormerJI 19991 ). 

The combined probability density dP2(h±, h.2) of simul- 
taneously finding two halos with properties h± and h? must 
include the interdependence of their probabilities. Assume 
that halos cluster according to a 3d two-point correlation 
function £hh(r, hi, /12) which is generally dependent on halo 
properties /ii/ 2 in addition to their separation r. We can 
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then write 



dP 2 (h 1 ,h 2 ) = 
dPi(hi) -dPi(fca) ■ (l+&fc(f»i,ha)) 



(A6) 



Because the tangential shear due to a randomly placed halo 
does not correlate with the shear due to another randomly 
placed halo, the only contribution to the covariance stems 
from the term proportional to £hh, and we can drop the 
other part for the numerical integration. In our halo model, 
what contributes to the correlation of halo positions is the 
linear evolution of structures. The non-linear evolution, i.e. 
the formation of halos themselves, need not be considered 
to this end, as it is dominant only on scales smaller than the 
ones considered here. The h alo-halo correlation function £hh 
can therefore be written as (jGiocoli et al.l l2010) 



&h(r, Mi, M 2 , z) « 6(Mi, z) ■ b(M 2 , z) ■ &Jr, z) 



(A7) 



giving the excess probability of finding a pair of halos with 
masses Mi and M2 at comoving separation r at redshift 
2 ~ 21 ~ 22. We use a linear power spectrum that is con- 
sistent with WMAP 5 results. We relate this to an angular 
correlation at redshift 2 by performing one redshift-space 
integral over the three-dimensional two-point correlation at 
a fixed angular separation and setting the redshift of two 
correlated halos to be equal for all other purposes. 

We calculate the shear signal of off-center halos on an- 
nuli 7* (h) by integrating the surface density of the halo 
inside and on the annulus and applying Eqn. [3] We take 
into account the width A = 0.1 of our annuli k with 
r 6 [(1 — A) • Tk, (1 + A) ■ r&] by calculating and tabulating 

,(1 + A)r / r« ( 

RdR ( J ° -K{R,h) I , (A8) 

'(l-A)r 



2Ar 2 



(-yt{x, ft.))(i-A).r k <x<(l+A)rfc 

J" fl fc(a:, h)xdx 
WJ2 



where j t (x,h) and n(x,h) are the average tangential shear 
and convergence a halo with properties h introduces to an 
annulus of radius x. The integral is numerically not more de- 
manding than the simpler one required for zero- width rings. 
As a simplification for numerical integrati on, we truncate the 
projected surface density of NFW halos (|Bartelmannll 19961 ; 
IWright fc~ Braincrd 2000) at a radius r trU nc = 20-r s > r2oom, 
which does not change the signal significantly. 

When doing the final integrations over dz to get the co- 
variance matrices, we leave out the range 0.162 < 2 < 0.338 
as it is already included in the simulations. We limit the mass 
range of halos included to 1O 8 M < M 20 om < 1O 14 M . 
This lowers the overall variance because of the lack of more 
massive halos, which can be expected to b e modelled sep- 
arate ly in real-world weak lensing analyses (Ho ekstra et al.l 

Hon]). 

We apply the approximation in which the uncorrelated 
LSS noise can simply be added to the reduced shear sig- 
nal of the clusters. Formally taking into account the effect 
of multiple lensing planes on reduced shears requires more 
complex calculations. 



Results 

Shear dispersions as a function of radius compared with 
shape noise for our binning scheme are shown in Figure |A"T1 
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Figure Al. Dispersion and covariances of tangential shear signal 
due to uncorrelated LSS. Dispersions (upper panel) are plotted for 
the complete redshift range (filled) and when removing contribu- 
tions from the range of the numerical simulations (open symbols) 
and compared to different levels of shape noise (lines). Covari- 
ances (lower panel) are plotted with respect to three different 
radii, corresponding to the three different symbols. 



together with covariances between the signal at different 
radii. The inner peak is dominated by the one-halo term 
while the slowly decreasing outer range is mostly due to two- 
halo contributions. The r esults from our c alculation agree 
with those presented in iHoekstral l|200lf ); iHoekstra et al.l 
l|201ll ). 
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Poisson Covariance due to Correlated Halos 

In a similar manner, we find the variance of the shear signal 
due to the spherical excess density of second halos around 
the halo under consideration. The shear signal in annulus k 
due to this can be expressed as in Eqn. IA1I where the sum- 
mation runs over correlated halos. Consequently, the shear 
covariance due to correlated halos 

Cov c ( 7t fc , 7 J) = J (dPx(h\h ) - dPi(fo)) i k t {h) ■ yl(h) , (A9) 

can be determined from the excess probability of halos with 
properties h given the presence of a central halo with prop- 
erties ho- The term dP\(h\ha) — dPi(h) is equal to Eqn. lA4l 
multiplied with (,hh(ho,hi). This approach makes the as- 
sumption that the probabilities of finding two second halos 
with properties hi and h.2 are mutually independent, which 
is why the covariance calculated in Eqn, IA9l is merely a lower 
limit to the true shear covariance due to clustering correlated 
second halos. 



